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Abstract 

We consider binary rotation words generated by partitions of the unit 
circle to two intervals and give a precise formula for the number of such 
words of length n. We also give the precise asymptotics for it, which 
happens to be 0(n 4 ) . The result continues the line initiated by the formula 
for the number of all Sturmian words obtained by Lipatov in 1982, then 
independently by Berenstein, Kanal, Lavine and Olson in 1987, Mignosi 
in 1991, and then with another technique by Berstel and Pocchiola in 
1993. 



1 Introduction 

Infinite words arising from rotations of the circle belong to the same family of 
infinite words defined by the means of dynamical systems as Sturmian words and 
interval exchange words in general. They were considered by G. Rote in 1992 
pTj and can be defined using three parameters a, /?, 7 G [0, 1) as r = r^ri • • • , 
where for all i we have 



Ti = 



fl, if {ia} G [/9 j7 ), 
I 0, otherwise. 



(1) 



(Here the interval [/3, 7) is denoted as usual if (3 < 7 and as [j3; 1) U [0,7) 
otherwise.) 

In the particular case when 7 — j3 — a (mod 1), w is a Sturmian word. 
The family of Sturmian words is very well studied (see Chapter 2 of [9]); in 
particular, the total number of factors of all Sturmian words taken together is 
known to be 

n 

l + 5^(n-p + %(p), 
P =i 
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where ip is the Euler's totient function. This formula was rediscovered several 
times [8j [H [TUl |3]; the order of growth of this function is 0(n 3 /7r 2 ). 

In [5J [7] Cassaigne and the first author estimated and for some cases found 
the number of factors of length n of all rotation words with a given length 7—/? of 
the interval; it happens that it also grows as 0(n 3 ). In 1 , Ambroz, Masakova, 
Pelantova and the first author estimated the number of all words arising from 
three-interval exchange, which continues the same line since Sturmian words 
are exactly two-interval exchange words; it happens that the number of three- 
interval exchange words grows as 0(n 4 ). In [5], Berstel and Vuillon coded 
rotation words by Sturmian words. 

In this paper, we find a precise formula for the number of all rotation words 
([U, predictably involving sums of the Euler's function. To write down the 
formula, we had to understand very clearly the structure of the set of rotation 
words, which is of independent interest. 



2 Main statement 

The main result of the paper is the following 

Theorem 1 Starting from n — 3, the number of binary rotation words of length 
n + 1 is 

f{n + 1) = n 2 + 3n + 4 + - £ <p(p) (n 2 - p 2 + n + p) - h (n) - 2 ]T / 2 (n, I) , (2) 
where 



p=3 



1=1 



t M _ !2e-^e;:xp), isn=2k+i, 

h{ ) ~ \2E^ 1 Kt\ <P(P) + Y,U <P<P), Hn= 2k, 
g(n, l) = n-l + l + (n mod (I + 1)), 
h(n, I) = min(7 + 1, n — I), 



(3) 
(4) 



(5 


n 




J + 1 . 



g(n, I) - h(n, l)\ + !)-!) + h(n, I) (^^±11 _ 1^ 



(5) 

Note that the only addend of this formula growing faster than than 0(n 3 ) is 
the sum 

n 
p=3 

So, the asymptotics of the number of binary rotation words is equal to the 
asymptotics of this addend, which means that 

S(n) = ^+0( n 3 \ogn). 
The values of /(n) for some values of n are shown in the table below. 
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n 


6 


7 


10 


15 


20 


30 


50 


75 


100 


fin) 


64 


112 


504 


2804 


9442 


51306 


423814 


2222984 


7155096 


47r 2 /(n) _ 
3n 4 


0.65 


0.61 


0.66 


0.73 


0.78 


0.83 


0.89 


0.92 


0.94 



The rest of the paper is devoted to the proof of Theorem Q] and thus to a 
study of internal structure of the set of rotation words. 

3 Rotations and Sturmian words 

Denote the prefix 7*0 • • • J" n -i of length n of the word r defined in {1} by r(a, f3, 7, n) 
The parameter a is called the slope of the rotation word r. The set of all rota- 
tion words r(a, /?, 7, n) of length n is denoted by R(n), so, the searched function 
is f(n) = #i?(n). 

Lemma 1 It is sufficient to consider rotation words of slopes not greater than 
1/2: 

R{n) = {r(a,p,~/,n)\a e (0,1/2), ft 7 e K/Z.} 

Proof. Due to the symmetry, we have r(a, /3, 7, n) = r(l — a, 1 — /3, 1 — 7, n) 
if {ka} 7^ /3 or 7 for all A; = 0, . . . , n — 1, that is, if the point ka is never equal 
to the end of an interval. But if it is, we can just take r(l — a, 1 — /3, 1 — 7, n) 
and then slightly shift the interval to avoid its ends. So, slopes less than 1/2 
and greater than 1/2 give exactly the same set of all rotation words. □ 

The following lemma is a particular case of the result of Berstel and Vuillon 
[5] . We give its proof for the sake of clarity. 

Lemma 2 For any two-interval rotation word r = r(a, /3, 7, n), where a < 1/2, 
we have 

rk = rk_i + tife - Vk (6) 

for the Sturmian words u — r(a,/3,/3 + a,n) and v — r(a, 7, 7 + 0/ t/ie 

s/ope a. 

Proof. The fact that Uk = 1 is equivalent to the fact that j3 £ [{(A: — 
l)a},{fca}); the fact that Vk = 1 is equivalent to the fact that 7 6 [{(k — 
l)a}, {fca}). So, if = = 0, the interval [{(fc — l)a}, {fca}) contains neither 
/3 nor 7, and thus r/. = rfc_i; if Uk — — 1, the interval [{(fc — l)a},{fca}) 
contains both j5 and 7, and thus ru — r^-x again; if Uk = 1 and Wfe = 0, then 
[{(fc — l)a}, {ka}) contains (3 but not 7, and thus rfc_i = and r& = 1; at last, 
if M/t = and Vk — 1, then [{(fe — l)a}, {ka}) contains 7 but not /3, and thus 
rk-\ — 1 and rfe = 0. In all the four cases (j6]) holds. □ 

Note that the symbols i*o and vq are not used in the previous lemma, so, we 
see that a rotation word of length n + 1 is uniquely defined by its first symbol 
and two Sturmian words of the same slope of length n. If these two Sturmian 
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words are distinct, we can uniquely reconstruct from them the symbol r ; if 
they are equal, both rotation words 0™ +1 and l n+1 can appear. It is clear also 
that each pair of Sturmian words of the same slope gives some rotation word 
(of that slope). This gives us the next lemma: 

Lemma 3 The number of binary rotation words is bounded as 



/(n + 1) < #{(u, w)|u,« G 5t(n, a), a e(0,l/2),«^«} + 2. □ (7) 



Here St(n, a) is the set of all Sturmian words of length n and of slope a. The 
addend 2 in the formula above corresponds to all possible pairs of equal Sturmian 
words which all correspond to two rotation words, 0™ +1 and 

Denote by f pa irs (n) the number of such pairs of distinct Sturmian words of 
length n of the same slope a G (0, 1/2), so that ([7]) can be rewritten as 



Proof. Recall that the Farey series F n of order n is the increasing sequence of 
all fractions between and 1 whose denominators are at most n. The intervals 
between consecutive Farey fractions are called Farey intervals. The first Farey 
fraction is taken to be = 0/1; all the others are of the form q/p, where 
1 < q < P < n, gcd(q,p) = 1. 

It is well-known that the sets St(n,a) coincide for all a from the same Farey 
interval of order n; if the beginning of the interval is the fraction q/p, we can 
denote this set as St(n,a) = St(n,q,p). 

Let us say that a Sturmian word of length n is new in the Farey interval start- 
ing from q/p if it belongs to St(n,q,p) but does not belong to any St(n,q',p') 
for q' /p' < q/p, where q/p, q' /p' G F n . Denote the set of all new Sturmian words 
from St(n, q,p) by N(n, q,p); all other words from Stin, q,p) are called old, and 
their set is denoted by Old(n, q,p). 

As it follows directly from the results by Berstel and Pocchiola [4], for all 
q/p G F n with p > 1 we have #N(n,q,p) = n — p + 1. As a corollary, we 
immediately see that #OW(n, q,p) = p. 

Now let us count f pa ir S (n) starting from the minimal slope and going on 
along the Farey series. In the interval starting from 0, all the words are new, 
and they give n(n + 1) pairs. In any other interval, we are interested only in 
pairs where at least one of the words is new, since the pairs where both words 
are old have been counted before. So, after excluding pairs of old words, we see 
in the interval starting from q/p the following number of new Sturmian pairs: 



f(n + 1) < fpairs(n) + 2. 



(8) 



Lemma 4 For all n > 1 we have 




p=3 



n(n + 1) - #Old{n,q,p){#Old{n,q,p) - 1) 



2 



p 2 + n + p. 



= n 
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Now note that the number of Farey fractions whose denominator is p and which 
are less than 1/2 is <p(j>)/2 for all p > 2; for p = 2, the only Farey fraction is 
1/2, but we are not interested in slopes greater than 1/2; for p — 1, the case 
of = 0/1 is a bit special and has been considered in the beginning of this 
paragraph. So, summing up, we obtain that 

1 - 

f P airs(n) = n(n + 1) + - ^if{p)(n 2 - p 2 + n + p). □ 

p=3 

Together with (|8]), this lemma already gives us an upper bound for f(n+ 1). 
However, to pass to a precise formula, we should classify the cases when different 
pairs of Sturmian words give the same rotation word. 

We start from the following 

Lemma 5 If a rotation word r contains both factors 00 and 11, then it appears 
from only one pair of Sturmian words u and v of the same slope a < 1/2. They 
can be found by the equalities Uk = Vk = if Tk-i = = 1 and Uk = rk, 
Vk = Tk—i otherwise. 

Proof. Let a be the symbol whose interval is shorter than the other one and 
thus not longer than 1/2. The jump a of the moving point cannot be greater 
than the length of the longer interval, and thus, since aa appears in r, we see 
that in any pair of Sturmian words of the slope a < 1/2, generating r, the slope 
a is less than the length of the interval corresponding to a, and all the more of 
the other interval. It means exactly that we can never have Uk = Vk = 1. So, 
we can uniquely reconstruct the words u and v. if rk = rk-i, then Uk = Vk = 0, 
if rk = 1 and rk-i = 0, then Uk = 1 and Vk — 0, and if rk — and rk-i = 1, 
then Uk = and Vk = 1. This is equivalent to the statement of the lemma. □ 

So, to classify all pairs of Sturmian words of the same slope a, we must 
consider only those of them which contain consecutive occurrences of at most 
one symbol. Due to the symmetry, we can suppose for a while that this symbol 
is 0, that is, that the rotation words considered do not contain the factor 11. 

The words 0™ +1 and 1™ +1 have been already excluded from consideration 
and gave the addend 2 to the formula |U So, in what follows we consider two 
cases: either r contains only one symbol 1, that is, r = l 10"~ l for some i, 
or r contains at least two 0s and two Is. The proof will rely on properties of 
Sturmian words, so that we start from listing some of them. 

4 Properties of Sturmian words 

Denote the set of all Sturmian words of the slope a by St(a); so, St(a,n) = 
{0, l} n n St(a). As always, we say that a word u from a binary language L is 
left (right) special in L if both Ou and lu (respectively, uO and ul) are also in 
L. The mirror image u of the word u = u\ ■ • ■ u m is the word u m ■ ■ ■ u\. 

The following statements are classical and can be found in any survey on 
Sturmian words (see, e. g., [9]). 
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Claim 1 Each language St (a) contains exactly one left special word u and one 
right special word v of each length, and u = v. A shorter left special word from 
St(a) is always a prefix of a longer one; the same holds symmetrically on right 
special words and suffixes. 

Claim 2 The total number of left (right) special Sturmian words of length n is 

Another family of known facts concerns the construction of Sturmian words 
with directive sequences, standard words and central words. The facts below 
can be found in [S] . 

Consider a directive sequence (di, d%, ■ ■ ■ ), where d\ > 0, di > for all i > 1, 
and construct a sequence of words 

s_i = 1, s = 0, s n = s^l 1 s n _ 2 (« > !)■ 

The words s n obtained are called standard words. All standard words are 
Sturmian, and for each Sturmian word w there exists a directive sequence such 
that all the standard words it generates are factors of w. So, the set of factors of 
w is uniquely determined by some directive sequence, which is directly related 
to the continuous fraction expansion [0, 1 + d\, di, • ■ • } of the slope ct of w. In 
what follows we denote this slope by a<j. 

The slope ad is not greater than 1/2 if and only if di > 0. Since in this 
paper we are interested in Sturmian words whose slope is less than 1/2, from 
now on we assume for each directive sequence that di > for all i > 0. 

It can be easily checked that all standard words of length at least 2 end by 
01 or 10. For each standard word s n = c n ab, where o / 6, a,b £ {0, 1}, the 
word c„ is called a central word. In what follows we denote the word c n ba by 

C 

The following facts on standard and central words can be easily proved. 
Once again, we refer to [9] for details. 

Claim 3 All standard words from St(ad) are left special in that language. 

Claim 4 For each standard word s n £ St(ad), the word s' n is also standard and 
belongs to St{ad), but is not left special in that language. 

Claim 5 If s n = sf l r L 1 s n _ 2 , then s' n = s^ 1 s„_ 2 Sn-i ■ 

Claim 6 Central words c n are bispecial in St(ad). 

Claim 7 A word c is bispecial in St(ad) if and only if it is obtained by deleting 
two last symbols from some word s^_ 1 s„_2, where < t < d n . We shall denote 
this word by c n j; in particular, c n _d n = c n . 

Claim 8 The total number of central words of a length n is ip(n + 2). 

Claim 9 The last two symbols of standard words alternate: i/s„_i ends by 01, 
then s n ends by 10, and vice versa. 



6 



For each directive sequence (di, d,2, ■ ■ .)> we denote the length of the nth 
standard word s n by l n . 

The following lemma can be proved analogously to Theorem 2.2.31 from [5]. 

Lemma 6 For all n > 1, if the standard word s n in the language St(ad) ends 
by a symbol a, then the word asf" +1+2 is not a factor of St(ctd). 

5 The case of unique 1 

Let r be a rotation word of the form l 10™~\ Clearly, if < i < n, this 
word is generated by all pairs of Sturmian words of the form (slOi, sOlt) with 
sl0i,s01i € S(n,a) for some slope a, where |s| = i — 1, \t\ = n — i — 1. In 
particular, s is a right special word in St(i — 1, a), and t is a left special word in 
St(n — i— I, a). Due to Claim [TJ the shorter of the words s and t is determined 
by the longer one. 

Suppose first that i > n — i; then s is longer than t and determines all the 
word slOt. So, the number of pairs of Sturmian words giving l 10 n ~ l is equal 
to the number of (right) special Sturmian words of length i — 1, that is, to 
Sp=i v(p) ( see Claim [2]). Symmetrically, if i < n — i, then this is t that defines 
all the word slOt, and the number of such pairs is equal to the number of (left) 
special Sturmian words of length n — i — 1, that is, to J2pZl <P(p)- 

Now if i = and the rotation word is 10™, the pairs of Sturmian words 
generating it are all pairs of the form 0s, Is. So, s is a special Sturmian word of 
length n — 1, and the number of such words is J2p=i <P(p)- Symmetrically, the 
number of pairs of Sturmian words generating the word 01™ is also the same. 

Summing up all the numbers above, we see that the n+1 rotation words of 
the form l 10™ - \ where < i < n, are generated by the following number of 
pairs of Sturmian words of length n: 



Starting with n = 3, the sums in fi(n) involve only special words of length 
at least 1. Exactly a half of them, namely, the left special words starting with 
and symmetrically the right special words ending with 0, correspond to slopes 
less than 1/2. So, the n + 1 rotation words of the form z 10 n_I are generated 
by /i(n)/2 pairs of Sturmian words. 

Exactly the same total number of pairs (in fact, the pairs (v, u), where (it, v) 
are the pairs considered above) generate the n+1 rotation words of the form 
F01 n- \ Starting from n — 3, it gives us exactly /i(n) pairs generating 2(n + 1) 
rotation words. 

6 The case of several Is 

Most of technical details of our result are hidden in the following 
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Theorem 2 Suppose a rotation word w is generated by at least two different 
pairs of Sturmian words of slope less than 1/2, and w contains at least two Is 
and at least two Os. Then w = O^IO'^IO 3 ' or w = V(01 l ) k 0V for some i,j > 0, 
I > 2, k > 1, and the number of pairs generating w is equal to ip(l + l)/2 if 
an d ipQ + 1) otherwise. 

This section is devoted to its proof which is based on the theory of standard 
Sturmian words and their construction with directive sequences (see Section [4]). 

Lemma 7 Suppose that a rotation word r contains a factor 010 fel 10 fc2 l, where 
k.2 > k\ > 0, and is generated by ap pair (u, v) of Sturmian words from some 
St(ad)- Then k\ = l n — 1 for some n and (&2 + 1) mod l n = l n —i- 

Proof. Clearly, the pair (it, v) generating r contains some factors v! — IOU1IOU2I 
and v' = 01ui01«20 for some U\^ui1 with \u-\\ = k\ — 1 and \u2\ — k2 — 1. The 
words u\ and U2 are bispecial in St(ctd)- So, u\ — c n _d for some n > and 
< d < d n . Note also that 1*2 is left special in St (a a) longer than s„_i, so 
it starts from s„_i since there is only one left special word of each length in 
St{a d ). 

Without loss of generality, suppose that s„_2 ends by 10; if it ends by 01, 
in all the arguments below we should just consider v' instead of u' . 

Suppose that d < d n ; then v! = 10s„_ 1 s n _2S n _iu"; in particular, it means 
that the word s^_ 1 s„_2S„_i can be extended to the left by 0. On the other 
hand, since d < d n , the same word can clearly be extended to the left by s n _i, 
and thus by its last symbol 1. We see that s^_ 1 s, i _2Sn-i is left special; but it 
is not possible since s„_2S„_i differs in two last symbols from s n _is n _2, and 
thus s^_ 1 s„_2S„_i is not equal to the prefix s%t\s n -2 of s n of the same length. 
Since s^l\s„_ 2 is the only left special word of its length in St(a d ), we see that 
0s^_ 1 s n _2 s 7i-i and thus vf are not in St(ptd), a contradiction. So, the case of 
d < d n is not possible, and thus u\ — c n for some n, and k\ = l n — 1. 

Now recall that 112 is longer than u\\ so, it is equal to cn,d for some N > n 
and < D < djv- Suppose that N > n + 1; then 1*2 starts with s n +\ and 
v! = 10s n s n +iu". As above, the word s n s n +i can be extended to the left by 1, 
which is the last symbol of s n +i, and it is not left special since it is not equal 
to the only special word s n +is n of the same length, so, 0s n s n +i and thus u are 
not elements of Stfptd)- 

So, iV = n+1, U2OI = s„s n _i for some d < c?„+i, and | W2 1 +2 mod l n = l n -i, 
which was to be proved. □ 

Lemma 8 Consider two slopes ad, ctd' < 1/2 with corresponding directive se- 
quences (g?i, g?2j • . .) and (d' 1; df 2 , ■ ■ ■) and respective lengths l n and l' n of standard 
words. If l n — l' m and l n -\ = l' m -i for some m and n, then n = m and di = d[ 
for all i = 1, . . . n. 

Proof. By the construction, for all i we have 1^2 = h mod Zj_i and di = [k : 

Z i 1 J . So, starting from l n = l' m and l n -i — l' m -i, we can uniquely reconstruct 

ln-2 = l' m -2i = e ^ c - Note that here di,d[ > since both slopes are 
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less than 1/2. So, as soon as we get l n -k = l' m -k = L we immediately see that 
n = m = k and [d\, . . . , d n ) = . . . , d' n ). □ 

Lemma 9 All rotation words arising from several pairs of Sturmian words 
of slope at most 1/2 and not containing two consecutive Is are of the form 
i (10') fc 10 J ' for some i,j > 0, k > 1, I > 2. 

Proof. Let us prove that a word arising from two pairs of Sturmian words, 
of slopes ad ^ aid', cannot contain a factor w — 10 mi 10™ 2 l with mi / m^- 
Suppose it contains it. The proof is carried over for ni2 > mi; the opposite 
case can be proved by the argument that the set of Sturmian words, the set of 
rotation words and the procedure generating a rotation word from two Sturmian 
words are symmetric under taking the mirror image. 

First suppose that Xw is also a rotation word. Due to Lemma [5l Iw appears 
from only one pair of Sturmian words, and this pair is (0 mi+1 10" 12 1, 010 mi 10 1 " 2 ). 
These two words are Sturmian of the same slope only if m,2 = mi + 1. But in 
this case, the word Ow = 010 mi 10 mi+1 l due to Lemma [JJ is also generated by 
only one pair, namely, by the pair (10 mi 10 mi+1 l, 010 1 ™ 1 10™ 2 ). So, w arises only 
from the pair (0 mi 10 mi+1 l, 10 mi 10 m2 ), contradicting to our assumption. 

So, if w arises from several pairs of Sturmian words of slope at most 1/2, 
then so does Ow. Since we suppose that W2 > mi, we can apply Lemma [71 
according to which mi = l n — 1 and mi mod mi = l n —i — 1, where U are the 
lengths coming from the directive sequence for the language of the Sturmian 
words involved. Due to Lemma [51 the values of l n and l n -i uniquely determine 
the sequence (di, d-2, . . . ,d n ), its length n and thus the central words Ui and U2 
such that Ow = OU1IOU2I and v = IU1OIU2O. So, we see that once again, Ow 
and thus w arise from only one pair of Sturmian words of slope less than 1/2, a 
contradiction. 

We have proved that our word is of the form I (10 z ) fc 10 : ' for some i,j > 0, 
k > 1 (since the case of a unique 1 is considered separately), and I > 0. It 
remains to consider the case of I = 1 and to see that the word I (10) fc 10 : ' is 
generated by only one pair of Sturmian words of slope less than 1/2 defined 
as follows: the central part (10) fc l is given by the pair u = 0(10) fc_1 l, v = 
l(01) fe_1 0, and the prefix and suffix zeros correspond to the common prefix 
■ ■ ■ 1010 and the common suffix 0101 • • • of the generating Sturmian words. □ 

Lemma 10 Each word w = l (10') fc 10- 7 with i,j<l,l>2,k>lis generated 
by ip (I + l)/2 different pairs of Sturmian words of the same slope not greater 
than 1/2. 

Proof. Suppose for simplicity that i, j > and consider a pair (u, v) such that 
w = r(u,v). Clearly, (u,v) = (slOcilO--- , 10c fe 10p, sOlciOl • • • ,01c fc 01j>) for 
some central words ci , . . . , Cfc of length I — 1 and some words s , p with \s\ = i — 
\p\ = j — 1. Since each Sturmian language St(a) contains at most one central 
word of length i — we have ci = c% = ■ ■ ■ = Ck = c. Moreover, the word p is 
left special, and so it is a prefix of c, and the word s is right special, so it is a 
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suffix of c. So, the pair (u, v) is uniquely determined by the central word c and 
the parameter i. 

There exists ip(l + 1) central words of length I — 1, and a half of them 
correspond to slopes less than 1/2. So, it remains to prove that for each central 
word c and each power k two words u = s(10c) fe 10p and v = s(0lc) k p, where s 
is a suffix and p is a prefix of c, appear in some Sturmian language of a given 
slope. 

Indeed, let clO be equal to the standard word s n — s^Li^n—v i n some lan- 
guage St{ad)- Then u is a factor of s k+2 . At the same time, cOl = s' n — 
s n-i ls »-2 s n-i) so that v is a factor of (s') k+2 = s n-i ls n-2 s n +ls n-i> which is 
in its turn a factor of s k+3 . So, taking d n+ i > k + 3, we see that u,v E St (ad) 
for the directive sequence d = (di, . . . , d n , fc + 3, . . .), which was to be proved. 

If i = 0, or j = 0, or/and the standard word s n is equal to cOl, not to clO, 
the proof is carried on similarly. □ 

Lemma 11 Each word w — l (10 i ) fc 10 J with i > I or j > I, I > 2, k > 1 is 
generated by ip(l + 1) different pairs of Sturmian words of the same slope not 
greater than 1/2. 

Proof. As above, if w = r(u,v), then u = s(10c) k 10p and v = s(01c) k 01p 
for some central word c of length / — 1 and some words s,p with \s\ = i — 1, 
\p\ = j — 1. We have a choice which of the words cOl and clO is a standard word 
in the Sturmian language considered; suppose it is clO = s n . Suppose also that 
j > I. Clearly, p is left special and thus is a prefix of some standard word sn, 
N > n. Suppose that p is not a prefix of some power of s n : it means that p 
contains as a prefix the word s n n+1 s n -is n , or, more precisely, the word obtained 
from it by erasing the last symbol, since s n -\s n differs from s n s n _i, which is a 
prefix of s^, by the two last symbols (see Claim [5]). In particular, p starts by 
s^' 1+1+1 , and thus the suffix OclOp = 0s n p of u starts with 0sn' +1+2 , which is 
not an element of St(ctd) due to Lemma [6) a contradiction. So, p is a prefix of 
s„ for some d. Symmetrically, if i > I, then s is a suffix of the mirror image 
of s„ for some d; by the way, this mirror image is equal to 01(c01) d ~ 1 c. For 
s n = cOl, we should just consider v instead of u to prove the similar statements. 

Note that since one of the parameters i or j is indeed greater than Z, the 
cases of s n = clO and of s n = cOl are really different, which gives us if (I + 1) 
cases: the total number of standard words of length I + 1 is twice bigger than 
the number of central words of length I — 1, that is, is equal to 2<p(l + 1), but 
we are interested only in those of slope less than 1/2. 

It remains to mention that for each standard word s = cab of length I + 1, 
where a, b G {0, 1} a ^ b, and for all d', d", k > 0, the words v! = (cba) d ' (cab) k+d " 
and v 1 = (cba) d +k (cab) d , so that w is a factor of r(u', v'), are factors of some 
language St(ctd). Indeed, let s — s n for the directive sequence (di, . . . , d n ); then 
s = cab = and cba = s n r ^ 1 s n _ 2 s„_i, so that u' = (sf"^ 1 s n ^ 2 s n -i) d ' (s^l 1 s„_ 2 ) 

s^Si Sn^si' 1 s n -is k l +d '\ andu' = s^ 1 s n ^ 2 s d ^ +k ^ 1 s n ^ lS f . Ifwetaked„+i = 
D = max{d' + k,d" + k}, we see that both u' and v' are factors of S® Sn-is® . 
So, both v! and v' , and thus the pair of words based on the standard word s 
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of length I + 1 and giving the rotation word w, are elements of the language 
St(ad) for the directive sequence d = [d\, . . . , d n , D + 2, . . .), which completes 
the proof of the lemma. □ 
This lemma, in its turn, completes the proof of Theorem [5J 



7 Final computations 

To find the precise formula for f(n + 1) for n > 3, we should subtract from the 
bound f P airs(n) + 2, where f pa irs{n) is found in Lemma 21 the number of pairs 
generating rotation words already obtained before. 

As it was shown in Section [SJ the 2(n + 1) rotation words with only one 
symbol 1 or only one are generated by fi(n) pairs (see (j9])). 

Now let us take into account the rotation words containing several Os and 
several Is. Their form is described in Theorem [5J Consider all rotation words 
of length n + 1 of the form 0*(10 ) 10 3 , such that I and i are fixed and k and j 
are not. They are \_(n — + 1)J ; and taking all words with a given I together, 
we see that they are 



E 

i=0 



n — i 


1 


n 


J+l. 


~ 2 


J + 



(n - I + 1 + [n mod (I + 1))). 



In what follows, to make the formulas shorter, we will denote 



](n,l) 



1 + 1 + (n mod (I + 1)), 



so that the words of the form 1 (10 i ) fc 10 J are g(".')h/('+i)J _ 

Each of these words is generated by tp(l + 1) pairs of Sturmian words, except 
for the min(Z + l,n — I) words with i,j<l which are generated by ip(l + l)/2 
pairs each. So, for each I > 2 we should subtract from the sum the following 
function: 



MM 



(i 


n 




i + 1 



g(n, /) - min(/ + 1, n - l) \ ((p(l+l)-l)+min(l+l, n-l) ^ 



The same function /2(n, Z) corresponds to the words of the form P(01 ) OlA 
So, to take into account all rotation words arising from several pairs and con- 
taining at least two Os and at least two Is, we should subtract from the upper 
bound the sum 2 Ym=2 foiji, I). 

Summarizing the above arguments, we see that 

n-l 

f(n + 1) = f pairs (n) +2-f 1 (n)+2{n + l)-2j2 / 2 (n, 0- 

1=2 

This is exactly the statement of Theorem [T] which was to be proved. □ 
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